Musical Mosaicing with High Level Descriptors

نویسندگان

  • John O'Connell
  • Perfecto Herrera
  • Jordi Janer
چکیده

This thesis investigates the use of high level descriptors (like genre, mood, instrumentation, singer's gender, etc.) in audio mosaicing, a form of data driven concatenative sound synthesis (CSS). The document begins by discussing the advances made in the eld of music content description over the last 10 years, explaining the meaning of high level music content description and highlighting the relevance of automatic music content description in general, to the eld of audio mosaicing. It proceeds, tracing the origins of mosaicing from its beginnings as a time consuming manual process, through to modern e orts to automate mosaicing and enhance the productivity of artists seeking to create mosaics. The essential components of a mosaicing system are described. Existing mosaicing systems are dissected and categorised into a taxonomy based on their potential application area. The time resolution of high level descriptors is investigated and a new hierarchical framework for incorporating high level descriptors into mosaicing applications is introduced and evaluated. This framework is written in Python and utilises pure data as both user interface and audio engine. Descriptors, stemming from Music Information Retrieval (MIR) research are calculated using an in-house analysis extraction tool. In-house audio-matching software is used as the similarity search engine. Many other libraries have also been integrated to aid the research, in particular Aubio for note detection, and Rubberband, for time stretching. The high level descriptors included in this project are; mood (happy, sad, relaxed or happy), gender (male or female), key, scale (major or minor), instrumental, vocal. A mini application for augmenting audio loops with mosaics is presented. This is used to show how the framework can be extended to cater for a given mosaicing paradigm. The musical applications of mosaics in the traditional song-based composition are also explored. Finally, conclusions are drawn and directions for future work postulated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Augmenting Sound Mosaicing with Descriptor-driven Transformation

We propose a strategy for integrating descriptor-driven transformation into mosaicing sound synthesis, in which samples are selected by taking into account potential distances in the transformed space. Target descriptors consisting of chroma, mel-spaced filter banks, and energy are modeled with respect to windowed bandlimited resampling and mel-spaced filters, and later corrected with gain. The...

متن کامل

Automatic Timbral Morphing Of Musical Instruments Sounds By High-Level Descriptors

The aim of sound morphing is to obtain a result that falls perceptually between two (or more) sounds. In order to do this, we should be able to morph perceptually relevant features of sounds instead of blindly interpolating the parameters of a model. In this work we present automatic timbral morphing techniques applied to musical instrument sounds using high-level descriptors as features. High-...

متن کامل

Cognition-inspired Descriptors for Scalable Cover Song Retrieval

Inspired by representations used in music cognition studies and computational musicology, we propose three simple and interpretable descriptors for use in midto high-level computational analysis of musical audio and applications in content-based retrieval. We also argue that the task of scalable cover song retrieval is very suitable for the development of descriptors that effectively capture mu...

متن کامل

Ringomatic: A Real-Time Interactive Drummer Using Constraint-Satisfaction and Drum Sound Descriptors

We describe a real-time musical agent that generates an audio drum-track by concatenating audio segments automatically extracted from pre-existing musical files. The drum-track can be controlled in real-time by specifying high-level properties (or constraints) holding on metadata automatically extracted from the audio segments. A constraint-satisfaction mechanism, based on local search, selects...

متن کامل

Training-based Semantic Descriptors modeling for violin quality sound characterization

Violin makers and musicians describe the timbral qualities of violins using semantic terms coming from natural language. In this study we use regression techniques of machine intelligence and audio features to model in a training-based fashion a set of high-level (semantic) descriptors for the automatic annotation of musical instruments. The most relevant semantic descriptors are collected thro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011